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Abstract 


Arabinogalactan-proteins (AGPs) are highly glycosylated proteins (glycoproteins) found in the cell walls of plants. 
AGPs account for only a small portion of the cell wall, usually no more than 1% of dry mass of the primary wall. 
AGPs are members of the hydroxyproline-rich glycoprotein (HRGP) superfamily that represent a large and diverse 
group of glycosylated wall proteins. AGPs have attracted considerable attention due to their highly complex struc- 
tures and potential roles in signaling. In addition, they have industrial and health applications due to their chemi- 
cal/physical properties (water-holding, adhesion and emulsification). Glycosylation can account for more than 
90% of the total mass. AGPs have been reported in a wide range of higher plants in seeds, roots, stems, leaves and 
inflorescences. They have also been reported in secretions of cell culture medium of root, leaf, endosperm and 
embryo tissues, and some exudate producing cell types such as stylar canal cells are capable of producing lavish 


amounts of AGPs. 
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AGP protein backbones and classifi- 
cation 


The protein component of AGPs is rich in the amino ac- 
ids Proline (P), Alanine (A), Serine (S) and Threonine (T), 
also known as ‘PAST’, and this amino acid bias is one of 
the features used to identify them. WIZIGIGMIF] AGPs are 
intrinsically disordered proteins as they contain a high 
proportion of disordering amino acids such as Proline 
that disrupt the formation of stable folded structures. 
Characteristic of intrinsically disordered proteins, AGPs 
also contain repeat motifs and post-translational modi- 
fications. “© Proline residues in the protein backbone 
can be hydroxylated to Hydroxyproline (O) depending 
on the surrounding amino acids. The ‘Hyp contiguity 
hypothesis’!!! predicts that when O occurs in a non- 
contiguous manner, for example the sequence 'SOTO', 
such as occurs in AGPs, this acts as a signal for O-linked 
glycosylation of large branched type Il arabinogalactan 
(AG) polysaccharides. !! Sequences that direct AG gly- 
cosylation (SO, TO, AO, VO) are called AGP glycomotifs 
(Figure 1). 
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All AGP protein backbones contain a minimum of 3 clus- 
tered AGP glycomotifs and an N-terminal signal pep- 
tide that directs the protein into the endoplasmic retic- 
ulum (ER) where post-translational modifications begin 
9] Prolyl hydroxylation of P to O is fulfilled by prolyl 4- 
hydroxylases (P4Hs) belonging to the 2-oxoglutarate 
dependant dioxygenase family. "©! P4H has been iden- 
tified in both the ER and Golgi apparatus (GA). ""! The 
addition of the glycosylphosphatidylinositol (GPI)- 
anchor occurs in most but not all AGPs. @I4) 


AGP family of glycoproteins 


AGPs belong to large multigene families and are di- 
vided into several sub-groups depending on the pre- 
dicted protein sequence. “2#I03I04IZIGI05] "Classical" 
AGPs include the GPI-AGPs that consist of a signal pep- 
tide at the N-terminus, a PAST-rich sequence of 100- 
150 aa and a hydrophobic region at the C-terminus that 
directs addition of a GPl-anchor; non GPI-AGPs that 
lack the C-terminal GPI signal sequence, Lysine(K)-rich 
AGPs that contain a K-rich region within the PAST-rich 
backbone and AG-peptide that have a short PAST-rich 
backbone of 10-15 aa (Figure 2). Chimeric AGPs consist 
of proteins that have an AGP region and an additional 
region with a recognised protein family (Pfam) domain. 
Chimeric AGPs include fasciclin-like AGPs (FLAs), phy- 
tocyanin-like AGPs (PAGs/PLAs, also known as early- 
nodulin-like proteins, ENODLs) and xylogen-like AGPs 
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(XYLPs) that contain lipid-transfer-like domains. ™ Sev- 
eral other putative chimeric AGP classes have been 
identified that include AG glycomotifs associated with 
protein kinase, leucine-rich repeat, X8, FH2 and other 
protein family domains. “I671 Other non-classical 
AGPs exist such as those containing a cysteine(C)-rich 
domain, also called PAC domains, and/or histidine(H)- 
rich domain, "®!!9! as well as many hybrid HRGPs that 
have motifs characteristic of AGPs and other HRGP 
members, usually extensin and Tyr motifs. @8I@0MIZIC] 
AGPs are evolutionarily ancient and have been identi- 
fied in green algae as well as Chromista and Glauco- 
phyta. #2124 Found throughout the entire plant line- 
age, land plants are suggested to have inherited and di- 
versified the existing AGP protein backbone genes pre- 
sent in algae to generate an enormous number of AGP 
glycoforms. 
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majority of which are bio-chemically uncharacterized. 
The GT31 family is one of the families involved in AGP 
glycan backbone biosynthesis. '°!?7] Numerous mem- 
bers of the GT31 family have been identified with Hyp- 
O-GALT activity . 78°] and the core 6-(1,3)-galactan 
backbone is also likely to be synthesized by the GT31 
family. "71 Members of the GT14 family are implicated 
in adding 6-(1,6)- and 6-(1,3)-galactans to AGPs. 2°16 
In Arabidopsis, terminal sugars such as fucose are pro- 
posed to be added by AtFUT4 (a fucosyl transferase) 
and AtFUT6 in the GT37 family °2!2] and the terminal 
GlcA incorporation can be catalysed by the GT14 fam- 
ily. Pl] A number of GTs remain to be identified, for 
example those responsible for terminal Rha. 


Bioinformatic analysis predicts the addition of a GPI- 
anchor on many AGPs."“! The early synthesis of the GPI 
moiety occurs on the ER cytoplasmic surface and sub- 
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Figure 1 | Protein sequence of a classical, GPl-anchored AGP3 from Arabidopsis thaliana. Features directing post-translational 
modification are highlighted: ER signal sequence highlighted purple, AGP glycomotifs highlighted blue and GPI signal sequence 


boxed. CC BY-SA 4.0 


AGP biosynthesis 


After translation, the AGP protein backbones are highly 
decorated with complex carbohydrates, primarily type 
Il AG polysaccharides. 2! The biosynthesis of the ma- 
ture AGP involves cleavage of the signal peptide at the 
N-terminus, hydroxylation on the P residues and subse- 
quent glycosylation and in many cases addition of a 
GPl-anchor. 


The structure of the AG glycans consists of a backbone 
of 6-1,3 linked galactose (Gal), with sidechains of 6-1,6 
linked Gal and have terminal residues of arabinose 
(Ara), rhamnose (Rha), Gal, fucose (Fuc), and glucuronic 
acid (GlcA). Glycosylation of the AGP backbone is sug- 
gested to initiate in the ER with the addition of first Gal 
by O-galactosyltransferase, which is predominantly lo- 
cated in ER fractions. 7! Chain extension then occurs 
primarily in the GA. “! The AG glycan moiety of AGPs 
is assembled by glycosyltransferases (GTs). °! O-glyco- 
sylation of AGPs is initiated by the action of Hyp-O-ga- 
lactosyltransferases (Hyp-O-GalTs) that add the first 
Gal onto the protein. The complex glycan structures are 
then elaborated by a suite of glycosyltransferases, the 
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sequent assembly take place in the lumen of the ER. 
These include the assembly of tri-mannose (Man), ga- 
lactose, non-N-acetylated glucosamine (GIcN) and eth- 
anolamine phosphate to form the mature GPI moi- 
ety.51861 AGPs undergo GPl-anchor addition while co- 
translationally migrating into the ER and these two pro- 
cesses finally converge. Subsequently, a transamidase 
complex simultaneously cleaves the core protein at the 
C-terminus when it recognizes the w cleavage site and 
transfers the fully assembled GPl-anchor onto the 
amino acid residue at the C-terminus of the protein. 
These events occur prior to prolyl hydroxylation and 
glycosylation.2”""4] The core glycan structure of GPI an- 
chors is Man-a-1,2-Man-a-1,6-Man-a-1,4-GIcN-inositol 
(Man: mannose, GIcN: glucosaminyl), which is con- 
served in many eukaryotes. COF8IF912510N01 The only 
plant GPI anchor structure characterized to date is the 
GPl-anchored AGP from Pyrus communis suspension- 
cultured cells.°°! This showed a partially modified gly- 
can moiety compared to previously characterized GPI 
anchors as it contained 6-1,4-Gal. The GPI anchor syn- 
thesis and protein assembly pathway is proposed to be 
conserved in mammals and plants." The integration of 
a GPI-anchor enables the attachment of the protein to 
the membrane of the ER transiting to the GA leading to 
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secretion to the outer leaflet of the plasma membrane 
facing the wall.) As proposed by Oxley and Bacic,?°! 
the GPl-anchored AGPs are likely released via cleavage 
by some phospholipases (PLs) (C or D) and secreted into 
the extracellular compartment. 


AGPs functional roles 


Human uses of AGPs include the use of Gum arabic in 
the food and pharmaceutical industries because of nat- 
ural properties in thickening and emulsification.) 
AGPs in cereal grains have potential applications in bio- 
fortification,!““! as sources of dietary fibre to support gut 
bacteria*! and protective agents against ethanol tox- 
icity (461 


AGPs are found in a wide range of plant tissues, in se- 
cretions of cell culture medium of root, leaf, endosperm 
and embryo tissues, and some exudate producing cell 
types such as stylar canal cells.7°! 7] AGPs have been 
shown to regulate many aspects of plant growth and 
development including male-female recognition in re- 
production organs, cell division and differentiation in 
embryo and post-embryo development, seed mucilage 
cell wall development, root salt tolerance and root-mi- 
crobe interactions (see Table 1).5!448] These studies 
suggest that they are multifunctional, similar to what is 
found in mammalian —proteoglycans/glycopro- 
teins.!4915°151] Conventional methods to study functions 
of AGPs include the use of 6-glycosyl (usually glucosyl) 
Yariv reagents and monoclonal antibodies (mAbs). 6- 
Glycosyl Yariv reagents are synthetic phenylazo glyco- 
side probes that specifically, but not covalently, bind to 
AGPs and can be used to precipitate AGPs from solu- 
tion.2! They are also used commonly as histochemical 
stains to probe the locations and distribution of 
AGPs,. 21541 A number of studies have shown that addi- 
tion of 6-Yariv reagents to plant growth medium can in- 
hibit seedling growth, cell elongation, block somatic 
embryogenesis and fresh cell wall mass accumula- 
tion.O°1621671 The use of mAbs that specifically bind to 
carbohydrate epitopes of AGPs have also been em- 
ployed to infer functions based on the location and pat- 
tern of the AGP epitopes.“®! Commonly used mAb 
against AGPs include CCRC-M7, LM2, JIM8, JIM13 and 
JIM14,59! 


The function of individual AGPs has largely been in- 
ferred through studies of mutants. For example, the Ar- 
abidopsis root-specific AtAGP30 was shown to be re- 
quired for in vitro root regeneration suggesting a func- 
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tion in regenerating the root by modulating phytohor- 
mone activity.'©° Studies of agp6 and agp11 mutants in 
Classical AGP 


Hybrid AGP 


Chimeric AGP 


i} AGP glycomotif 


(ae | ER signal 
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Figure 2 | Schematic of the predicted structures of se- 
lected AGP sub-families. Classical AGPs contain glyco- 
motifs directing hydroxylation of P to O and subsequent 
O-glycosylation to which large type II arabinogalactan 
chains (Type Il AG, yellow shapes) are attached. Many 
AGPs are predicted to contain a GPl-anchor at the C- 
terminus. Hybrid AGPs contain motifs characteristic of 
more than one HRGP family, for example glycomotifs 
typical of extensins (SP3-s; red bars) that direct addition 
of short arabinose (Ara) side chains (dark red) on Hyp 
and galactose (Gal; green) on Ser residues. Cross-link- 
ing Tyr motifs (dark blue bars) may also be present in 
the protein backbone. Chimeric AGPs have a recog- 
nised PFAM domain (green) in addition to the AGP re- 
gion. 
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Arabidopsis have demonstrated the importance of 
these AGPs to prevent uncontrolled generation of the 
pollen-grain and for normal growth of the pollen- 
tube.!64l6] The functional mechanisms of AGPs in cell 
signalling is not well understood. One proposed model 
suggests AGPs can interact and control the release of 
calcium from AG glycan (via GlcA residues) to trigger 
downstream signalling pathways mediated by cal- 
cium. 2141651 Another possible mechanism, largely 
based on the study of FLAs, suggests the combination 
of fascicilin domain and AG glycans can mediate cell- 
cell adhesion.!$7 Functions attributed to AGPs are 
outlined in Table 1. 
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Table 1| Summary of proposed functions of AGPs in plant growth and development. 


; ; Sn ; ; 

Biological role AGP Fallb Location(s) Function(s) meretances 

Embryogenesis GhPLA1 Somatic embryos Promoting somatic embryogenesis a8 
DcAGPs Somatic embryos Promoting somatic embryogenesis eA 
AtAGPs Embryos Embryo development and differentiation 70, 
NtAGPs Embryos Embryo development 71 
BgAGPs Somatic embryos Somatic embryo development rate and morphology 72 
BnAGPs Embryos Embryo development 54 
MaAGPs Somatic embryos Promoting somatic embryogenesis 73 
PsAGPs Promoting somatic embryogenesis 7h 
FsAGPs Embryos Establishment and stability of the cell wall 21 
VcALGAL-CAM embryos Embryo cell adhesion 66 
VclSG embryos Embryo inversion 75 

Reproduction AtAGP4 (JAGGER) Pistil Pollen tube blockage zi 
AtAGP6, AtAGP11 Stamen, pollen grain and pollen tube Pollen grain development and pollen tube growth 778) 
AtAGP18 Ovule Megaspore selection 79(80] 
AtFLA3 Pollen grain and pollen tube Microspore development 81 
AtENODL11-15 Micropylar Pollen tube reception 82][83] 
BcmMF8 Pollen grain and pollen tube Pollen wall development and pollen tube growth 84 
BcmMF18 Pollen grain Pollen grain development, intine formation 85 
NtTTS Pistil Pollen tube growth and guidance 86 
Np/Na120kD Pistil S-specific pollen rejection (self-incompatibility) 87 
OsMTR1 Male reproductive cells Anther development and pollen fertility 88 

Plant development AtAGP19 IStem, flower, root and leaf Cell division and expansion, leaf development and reproduction '°9 
AtAGP57C Rosette leaf, silique, seed, flower, and shoot apex of inflorescence stem Cell wall structure maintenance 20) 
AtFLA1 Stomata, trichome, leaf vasculature, primary root tip and lateral root ~— Lateral root development and shoot regeneration 31 
AtFLAg (SOSSs) Flower, leaf, stem, root, silique Root salt stress tolerance; seed mucilage adherence 921(931[94][95]196] 
PpAGP1 Apical cells Apical cell expansion 97 
AtAGP30 Root Root regeneration and seed germination 60 
BcrFLA1 Root Root hair elongation 98 


Secondary wall 
development 


AtFLAa11, AtFLA12 
AtXYP1, AtXYP2 


Stem and branch 


Cell walls of differentiating tracheary elements 


Secondary cell wall synthesis/patterning 
Vascular tissue development and patterning 


GhAGP4 Cotton fiber Cotton fiber initiation and elongation 81 

GhFLA1 Cotton fiber Fiber initiation and elongation 14 

PtFLA6 Stem xylem fiber Secondary cell wall synthesis/patterning 101) 
Defense SlattAGP Site of parasite attack Promotes parasite adherence f202] 
Plant-microbe AtAGP17 Root Agrobacterium tumefaciens root transformation [203] 
interaction 


5 of 9 | WikiJournal of Science 


8 © 


WikiJournal of Science, 2021, 4(1):5 
doi: 10.15347/wjs/2021.10.15347/WJS/2021.002 
Encyclopedic Review Article 


fl Gh: w:Gossypium hirsutum, Dc: w:Daucus carota, At: w:Arabidopsis thaliana, Nt: w:Nicotiana tabacum, Bg: 


w:Bactris gasipaes, Bn: w:Brassica napus, Ma: w:Musa spp. AAA, Ps: w:Pelargonium sidoides, Fs: w:Fucus serratus, 
Vc: w:Volvox carteri, Bcm: w:Brassica campestris, Np: w:Nicotiana plumbaginifolia, Na: w:Nicotiana alata, Os: 
w: Oryza sativa, Pp: w:Physcomitrella patens, Bcr: w:Brassica carinata, Pt: w:Populus trichocarpa, S|: w:Solanum 


lycopersicum. 


fl PLA: phytocyanin-like AGP. ALGAL-CAM: algal cell adhesion molecule. ISG: inversion-specific glycoprotein. 


FLA: fasciclin like AGP. ENODL: earlt nodulation like. MF: male fertility. TTS: transmitting tissue specific. MTR: 
microspore and tapetum regulator. SOS: salt overly sensitive. XYP: xylogen protein. attAGP: attachment AGP 
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